Phoneme Segmentation of Tamil Speech Signals Using Spectral Transition Measure
نویسنده
چکیده
Process of identifying the end points of the acoustic units of the speech signal is called speech segmentation. Speech recognition systems can be designed using sub-word unit like phoneme. A Phoneme is the smallest unit of the language. It is context dependent and tedious to find the boundary. Automated phoneme segmentation is carried in researches using Short term Energy, Convex hull, Formant, Spectral Transition Measure (STM), Group Delay Functions, Bayesian Information Criterion, etc. In this research work, STM is used to find the phoneme boundary of Tamil speech utterances. Tamil spoken word dataset was prepared with 30 words uttered by 4 native speakers with a high quality microphone. The performance of the segmentation is analysed and results are presented.
منابع مشابه
Grapheme Segmentation of Tamil Speech Signals using Excitation Information with MFCC and LPCC Features
The major components of automatic Speech Recognition(ASR)are the pronunciation dictionary, language models, acoustic model and decoder. The Pronunciation dictionaries define the mapping between the words and basic sounds of a language and thus play a vital role in speech recognition systems. Construction of the pronunciation dictionary is expensive and time consuming since it requires the knowl...
متن کاملDesign of language models at various phases of Tamil speech recognition system
This paper describes the use of language models in various phases of Tamil speech recognition system for improving its performance. In this work, the language models are applied at various levels of speech recognition such as segmentation phase, recognition phase and the syllable and word level error correction phase. The speech signals were segmented at phonetic level based on their acoustic c...
متن کاملUnsupervised Phoneme Segmentation Using Transformed Cepstrum Features
One of the basic problems in speech engineering is phoneme segmentation, that is, to divide a speech stream into a string of phonemes. Automatic Speech Recognition (ASR) models often require reliable phoneme segmentation in the initial training phase, and Text-to-Speech (TTS) systems need a large speech database with correct phoneme segmentation information for improving the performance. Human ...
متن کاملDynamic programming based segmentation approach to LSF matrix reconstruction
We propose a methodology of speech segmentation in which the LSF feature vector matrix of a segment is reconstructed optimally using a set of parametric/non-parametric functions. We have explored approximations using basis functions or polynomials. We have analyzed the performance of these methods w.r.t. phoneme segmentation (on 100 TIMIT sentences) and reconstruction error based on spectral di...
متن کاملStatistical corpus-based speech segmentation
An automatic speech segmentation technique is presented that is based on the alignment of a target speech signal with a set of different reference speech signals generated by a specific designed corpus-based speech synthesis system that additionally generates phoneme boundary markers. Each reference signal is then warped to the target speech signal. By synthesizing and warping many different re...
متن کامل